A Dynamic Spoken Dialogue Interface for Ambient Intelligence Interaction
نویسندگان
چکیده
In this article, we present the interpretation and generation processes of a spoken dialogue interface for ambient intelligence. The interface is automatically created for each specific environment and the interpretation and generation vary depending on the environment and its context. These processes rely on a dialogue tree structure. Several modules process the tree structure and the context information to produce specific dialogues for the current environment state. The interface has been implemented and evaluated in an ambient intelligence environment. Satisfactory objective and subjective evaluation results are shown at the end of the article. DOI: 10.4018/jaci.2010010103 International Journal of Ambient Computing and Intelligence, 2(1), 24-51, January-March 2010 25 Copyright © 2010, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. On the other hand, some projects try to obtain more natural forms of communication to integrate them with the environment. That is the case of Aire (Adler and Davis, 2004), which has studied the possibilities for combining sketching with speech for multimodal design. Another project that has explored the use of speech for interacting with the environment is Homey (Milward & Beveridge, 2004). This project aims to carry out research on an intelligent dialogue interface designed to develop a dialogue between a tele-medicine interface and a patient. Considering the environment characteristics, this dialogue interface requires dynamic adaptation. Furthermore, the interaction can be multimodal. One of the main contributions to this field was the project Smartkom (Wahlster, 2006). This interface recognized speech or gestures and generated text, graphics or speech. Users could employ any of these modalities in three different scenarios: at home or in the office, at a communications booth and on the move with mobile devices. In this article we present a Spanish spoken dialogue interface for ambient intelligence environments. A dialogue control structure is automatically created according to the specific environment and it allows to interact with the environment and control its devices by means of spoken language interaction. Adaptation occurs at the interface creation and interaction processes. In both cases the interface and its behaviour automatically vary depending on the environment and its state. Contextual information obtained from the environment is employed to assist dialogue processes such as simple pronominal anaphora resolution, sentence interpretation, or recognition error recovering. The article is organized as follows. In Section 2 we introduce the concept of spoken dialogue interfaces in ambient intelligence environments. Section 3 presents the implemented ambient intelligence environment. We provide an overview of the environment representation in Section 4 and of the dialogue representation in Section 5. In Section 6 we give a more concise description of the interpretation and generation algorithms. Section 7 provides real examples of interaction. The interface evaluation is explained in Section 8. Finally we give some conclusion in Section 9. 2. dIALoGUE InTErFACES In AMBIEnT InTELLIGEnCE EnvIronMEnTS Although the presence of sound is not an essential characteristic for an ambient intelligence environment, we consider that a spoken dialogue interface is an important aspect in the development of these environments. Speech is a common, spontaneous and simple mean of communication (Clark & Brennan, 1991). This way, although it cannot always be the best input mechanism, it is a powerful method for the development of person-computer communication environments (Karat et al., 1999). This kind of interaction provides ambient intelligence environments with a more natural and intuitive way of communication. A continuous interaction in a daily occupied highly interactive environment without the possibility of using the voice could be a considerable effort and decrease significantly the capabilities of its occupants. Moreover, a field research study carried out with real subjects to know their expectations about ambient intelligence environments shows that people prefer to employ their voice to control the home devices and, when they can choose between different modalities, they mainly choose oral communication (Brumitt & Cadiz, 2001). Most of the spoken dialogue interfaces developed so far have focussed on the desktop classic environment or telephone-based agents for bank assistance, route planning or ticket reservation. These approaches have to be modified in the context of an ambient intelligence environment, where the interaction is addressed to a heterogeneous set of physical devices. Another differential factor for these interfaces is established by its idiosyncrasy. They are highly dynamic spaces whose configurations 26 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/article/dynamic-spoken-dialogueinterface-ambient/40348?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Computer Science, Security, and Information Technology. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2
منابع مشابه
A Proposal for an XML Definition of a Dynamic Spoken Interface for Ambient Intelligence
Environments based on ambient intelligence require new interfaces that allow a natural interaction. The development of these interfaces has to be done in a standard way, considering the dynamic characteristics of these environments. In this paper we present the results in the development of an intelligent environment and a description language for the automatic generation of a spoken dialogue i...
متن کاملTowards ubiquitous task management
In the near future people will be surrounded by intelligent devices embedded in everyday objects where the knowledge and understanding of device attributes and capabilities will be a key enabler. This paper describes the current state of our research in design distributed knowledge based devices as a solution to adapt spoken dialogue systems within ambient intelligence. In this context a spoken...
متن کاملNew Directions in Spoken Dialogue Technology for Pervasive Interfaces
Spoken dialogue technology has emerged over the past decade as a challenging area for researchers in artificial intelligence, speech and language processing, and humancomputer interaction. At the same time a number of leading players in the computing industry have been looking seriously at the commercial potential of interactive spoken and multimodal systems. This paper considers the challenges...
متن کاملA Study of the Use of a Virtual Agent in an Ambient Intelligence Environment
In this paper we present the results in the evaluation of the use of a virtual agent together with a spoken dialogue system in an ambient intelligence environment. To develop the study, 35 different subjects had to perform eight different tasks and fill in a questionnaire with their impressions. From the answers we can conclude that the use of the virtual agent does not provide an improvement i...
متن کاملSystem Architectures for Speech-based and Multimodal Pervasive Computing Applications
Speech-based and multimodal interaction can be very efficient and natural way for human-computer communication in pervasive computing settings. The key features in these settings are the distributed and adaptive nature of interaction. In order to implement applications efficiently the system architecture must support these features. In this paper we discuss the requirements for speech-based per...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJACI
دوره 2 شماره
صفحات -
تاریخ انتشار 2010